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Abstract 

Dependency Grammar has been used by linguists as the basis of the syntactic components of 
their grammar formalisms. It has also been used in natural langauge parsing. In China, 
attempts have been made to use this grammar formalism to parse Chinese sentences using 
corpus-based techniques. This paper reviews the properties of Dependency Grammar as 
embodied in four axioms for the well-formedness conditions for dependency structures. It is 
shown that allowing multiple governors as done by some followers of this formalism is 
unnecessary. The practice of augmenting Dependency Grammar with functional labels is 
discussed in the light of building functional structures when the sentence is parsed. This will 
also facilitate semantic interpretion. 

1 Introduction 

Dependency Grammar (DG) is a grammatical theory proposed by the French Unguist 
Tesmere.[l] Its formal properties were then studied by Gaifman [2] and his results were 
brought to the attention of the linguistic community by Hayes. [3] Robinson [4] 
considered the possiblity of using this grammar within a transformation-generative 
framework and formulated four axioms for the well-formedness of dependency 
structures. Hudson [5] adopted Dependency Grammar as the basis of the syntactic 
component of his Word Grammar, though he allowed dependency relations outlawed by 
Robinson's axioms. 

Dependency Grammar is concerned directly with individual words. The 'grammar' 
is about what companions a word can have by contracting an asymmetric head-modifier 

(governor-dependent) kind of relation with them. For example, using Hudson's 
convention of letting arrows point from heads to modifiers, we have the following 
Chinese sentence from [6]: 



m mK m A ^ m. n cu) 

The main (or central) element, in Hayes' terminology, of the whole sentence is ^ 
Its immediate dependents are ^ and with ^ in turn having dependents of its 
own. It can be readily seen that this kind of analysis of a sentence is not unlike 
immediate constituency (IC). The main feature of DG that distinguishes it from IC 
analysis is that of the three 'top-level' elements of (1), is different from ^ and ^ 
in the sense that it is the 'head' element. This captures the traditional view that the two 
subordinate elements are 'arguments' of the main 'predicate'. 

Another important observation is that when we use a phrase-structure grammar 
(PSG) to formalize an IC kind of analysis, we obtain for (1), for example, a phrase 
marker (PM), with the node labels left out, like the following from [7]: 




m mK m A ^ m ^-m m cu) 



In this phrase marker, intermediate nodes are required to represent the intermediate 
constituents at various depths of the tree. With a DG analysis, on the other hand, the 
dependency structure (DS) in (1) is equivalent to the following tree representation [6]: 




We have no unnecessary intermediate nodes in (la). This saves space and, more 
importantly, provides a mechanism for the head of the sub-constituents to be identified. 

The correspondence between PM's hke (la) and DG's hke (1) or (lb) is many-to- 
one. [7] gives a procedure to derive from a PM, in which heads of sub-constituents can 



be identified, into a DS. They also give a procedure to derive from a DS like (1) or (lb) 
a 'minimal' PM like (la). 

DG has been used in linguistic analysis [8] and natural language parsing [9, 10] 
for languages with relatively free word order Uke Japanese and Korean. There have also 
been attempts to analyze Chinese sentences using DG [11] and corpus-based statistical 
parsing techniques. [12, 13, 14, 15] Efforts have been made to incorporate syntax- 
motivated rules in the parsing algorithm. [16, 17] Unification-based techniques can also 
be used in DG parsing. [18] 

2 Formal Properties of Dependency Grammar 

While exploring the possibihty of using DG as the base component of a 
transformational-generative formalism, [19] Robinson gives four axioms for the well- 
formedness of dependency structures: [4] 

(a) One and only one element is independent; 

(b) all others depend directly on some element; 

(c) no elements depends directly on more than one other; 

(d) if A depends directly on B and some element C intervenes between them (in 
linear order of string), then C depends directly on A or on B or some other 
intervening element. 

For ease of reference, we shall call (a) to (d) Al to A4. These four axioms are adequate 
for sanctioning dependency structures generated by DG's. Followers of DG in China 
have nevertheless introduced a fifth one: [13] 

A5 An element cannot have dependents lying on the other side of its own 

govemor. 

This addition is formally a corollary of A4, but it makes inspection of possible 
dependency relations easier in many circumstances. 

Robinson [4] intends to use DG to generate a 'structure free' core of natural 
language, so that transformations can apply to this core to yeild the entire language. She 
argues that to account for the generation of 'structure-sensitive' strings hke 'aabaa' using 
a DG formahsm will require the formulation of 'cumbersome' and 'ad hoc' rules. She 
follows Hayes [3] in using rules like the following for a DG: 



Rule (a) allows a governing 'auxilliary alphabet' X to have A, B, Z as dependents. 
The elements A, B, Z will occur in the linear order as given, with the govemor 
situated between H and Y. Rule (b) says that the 'terminal alphabet' corresponding to X 
occurs without any dependents. Rule (c) says that X occurs without any govemor. In 
other words, it is the 'main' or 'central' element of the given 'phrase'. 

Gaifman [2] proves that the class of utterances that can be generated by rules like 
(a), (b) and (c) above is the class of context-free languages. He also proves that every 
DG is 'strongly equivalent' to an IC (read PS for 'phrase-stmcture') grammar in the 
sense that: 

(a) they have the same 'terminal alphabet'; 

(b) for every string over that alphabet, every stmcture attributed by either 
grammar corresponds to a structure attributed by the other. 



(a) X(A,B,C,...,H,*,Y,...,Z) 

(b) X(*) 

(c) *(X) 



(HI) 
(H2) 
(H3) 



An IC (read 'PS') grammar with a property established by Gaifinan is however not 
strongly equivalent to any DG. 

Without going into technical details, Gaifman's results can be appreciated by 
noting that all DG's like (1) or (lb) in the previous section have corresponding PM's hke 
(la), while PM's (not being quahfied by any properties) do not have corresponding DS's. 

3 Analysis of Chinese Sentences 



The comphcations mentioned in the previous section aside, DG as embodied in 
Robinson's axioms can be taken as a good attempt at capturing syntactic structure in 
language. The head-modifier relation is certainly captured. Besides, it should be noted 
that A4 captures the non-crossing constraint well-documented in PS-based grammar 
formalisms. To see this, let us look at the following two DG analyses for the same 
sentence: 




if T 



C2) 



m ^ If sf 7 



C3) 



DS (3) comphes with the axioms of Robinson. It corresponds to the following well- 
formed PM: 




C3a) 



if T 



DS (2) violates the corollary A5 (and hence the axiom A4). It corresponds to an ill- 
formed PM with two instances of cross-over: 



■^i m m m. m ^ m T 



This example involves long-distance dependency. It is obviously outside the core of 
Robinson, [4] who will certainly require the use of transformations to account for this 
sentence. 

Allowing (3) and (3a) and outlawing (2) and (2a) makes perfect sense when one is 
parsing 'surface' strings. Probing into deeper syntactic, functional or semantic 
relationships, it may be reasonable for ^ to be 'governed' by ^ , but in the surface 
structure, there should be nothing wrong in letting the topic ^ be the main element of 
the sentence. It should be noted therefore that, hke PS grammars, DG, at least as 
embodied in Robinson's axioms, are for surface syntactic structures. 

4 Adding a Functional Element 

Using DG to account for only surface syntactic dependencies is in no sense too 
restrictive. It may be noted that in Lexical-Functional Grammar (LFG),[20, 21] context- 
free phrase structure rules are used to account for surface syntactic structures. Deeper 
dependencies are taken care of by a functional element. To account for (3) and (3a), 
LFG uses the following functionally annotated rule: [22] 
(LFl) S' ^ XP S 
(tTOPIC=-i) T=i 

LFG does not bother about which constituent is the syntactic head. In (LFl), S is the 
functional head. When the functional structure is being built up, the topic ^ is moved 
down to a position subordinate to the main predicate representing ^ . Its functional 
relationship with ^ , the embedded predicate, is estabhshed by means of functional 
control. 

In fact, we claim that it is possible to add a functional element to DG. Using the 
set of dependency relation labels in [23], sentence (1) is actually analyzed in [6] as 
follows: 



obj cp 




Clc) 



The labels of the dependency arcs can be seen to have a functional nature. With this 
kind of mechanism, we can also label the arcs in DS (3) to show that ^ bears a 



'comment' relationship to ^ functionally. A functional control mechanism, together with 
lexical information that ^ needs an object, will be able to assign the govemor of the 
'comment' relation its proper functional role in the subordinate clause. 
Another example, from [23], involving control is: 




A functional element is incorporated in the form of labels of dependency relations 
in all major efforts using DG to parse natural languages. When Chinese sentences are 
parsed in [12, 13, 14, 16, 17], the functional labels are there when the parse is produced. 

The DCs of Hayes [3] and Robinson [4] are of a purer syntactic nature. But, 
using LFG as model again, functional aimotations can be added to the Hayesian DG 
rules. (HI) can be annotated as follows: 

(a') X(A(fa), B(fb), *, Z(fz)) (HI') 

where fa, fb, fz are short-hands for LFG-hke functional annotatins fa(X) = A, etc., 
reading A is X's fa. 

Unlike LFG, the syntactic head and the functional head must be the same in 
annotated Hayesian DG rules. Functional annotations can only be attached to non-heads 
in Hayesian DG rules, and instead of adding aimotation to the topic XP as in (LFl), we 
are forced to have: 

(DGl) X(*, Y(COMMENT)) 

But (DGl) and (LFl) differ essentially only in polarity. There should not cause any 
difficulty for the formalism. It can thus be concluded that DG's can be augmented with a 
functional element so that syntactic and functional structures are obtained at the same 
time when a sentence is parsed. 

Availability of the functional structures will also facilitate semantic interpretation 
as in LFG. For (3), the functionally annotated DS: 

comment 

a m^^'^\ m ^ If 7 C3b) 

will yield a semantic structure like: 

commentC^t', '#'('#', '#'...'('#', x))) (3c) 

where X is co-referential with The exact semantic roles of the arguments of the 
predicate '^' can be obtained from lexical information. 



5 Against Multiple Heads 



We have been adhering to Robinson's four axioms so far. It should be noted that Hudson 
[5], in contravention to these axioms, allows multiple governors ('head-sharing' in his 
terminlogy) in the analysis of sentences like: 




I believe hiiti to know Sanskrit (5) 

In this analysis, him is governed by believe and know (as well as by to), violating 
Robinson's axiom A3. This is actually not necessary because underlying semantic 
dependencies can be dealt with by means of functional control as we have done. 

Hudson's head-sharing also gives leads to cross-dependency, in violation of axiom 
A4, in his analysis of the following Dutch sentence from [20]: 




The non-crossing constraint as embodied in axiom A4 is violated as Piet is governed by 
zwemmen while zag, which lies between them, governs Jan. This situation will not arise 
if we stick to the principle that DG only takes care of surface syntactic structures and let 
deeper dependencies be dealt with by means of control. This observation supports our 
standpoint of adhering to Robinson's axioms in our interpretation of DG. 



6 Conclusion 



We have discussed how DG grammar can be augmented with LFG-style functional 
annotation while abiding by a set of axioms that effectively make Dependency Grammar 
capable of deahng with only surface syntactic dependencies. More complex functional 
and semantic dependencies can be dealt with by means of control mechanisms. The 
availabihty of functional structures will also facihtate semantic interpretation. 
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